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Abstract 

This paper formulates a model of utility for a continuous time framework 
that captures the decision-maker's concern with ambiguity about both the 
drift and volatility of the driving process. At a technical level, the analy- 
sis requires a significant departure from existing continuous time modeling 
because it cannot be done within a probability space framework. This is 
fSJ ■ because ambiguity about volatility leads invariably to a set of nonequivalent 



priors, that is, to priors that disagree about which scenarios are possible. 
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1. Introduction 



This paper formulates a model of utility for a continuous time framework that 
captures the decision-maker's concern with ambiguity or model uncertainty. The 
paper's novelty lies in the range of model uncertainty that is accommodated. 
Specifically, aversion to ambiguity about both drift and volatility is captured. 
At a technical level, the analysis requires a significant departure from existing 
continuous time modeling because it cannot be done within a probability space 
framework. This is because ambiguity about volatility leads invariably to an 
undominated set of priors. In fact, priors are typically nonequivalent (not mutually 
absolutely continuous) - they disagree about which scenarios are possible. 

The model of utility is a continuous time version of multiple priors (or maxmin) 
utility formulated by Gilboa and Schmeidler [13] for a static setting. Related 
continuous time models are provided by Chen and Epstein [3] and also Hansen, 
Sargent and coauthors (see Anderson et al. [1], for example). 1 In these papers, 
ambiguity is modeled so as to retain the property that all priors are equivalent. 
This universal restriction is driven by the technical demands of continuous time 
modeling, specifically by the need to work within a probability space framework. 
Notably, in order to describe ambiguity authors invariably rely on Girsanov's the- 
orem for changing measures. It provides a tractable characterization of alternative 
hypotheses about the true probability law, but it also limits alternative hypothe- 
ses to correspond to measures that are both mutually equivalent and that differ 
from one another only in what they imply about drift. This paper defines a more 
general framework within which one can model the utility of an individual who 
is not completely confident in any single probability law for volatility or in which 
future events are possible. 

Economic motivation for our model, some applications to asset pricing, and 
also an informal intuitive outline of the construction of utility (without proofs), are 
provided in a companion paper Epstein and Ji [8]. 2 The present paper provides 
a mathematically rigorous treatment of utility. Thus, for example, it provides 
the foundations for an equilibrium analysis of asset markets in the presence of 
ambiguous volatility. The reader is referred to [8] for further discussion of the 

1 The discrete time counterpart of the former is axiomatized in Epstein and Schneider [9]. 

2 Economic motivation is provided in part by the importance of stochastic volatility modeling 
in both financial economics and macroeconomics, the evidence that the dynamics of volatility are 
complicated and difficult to pin down empirically, and the presumption that complete confidence 
in any single parametric specification is often unwarranted and implausible. 
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framework and its economic rationale. 3 

The challenge in formulating the model is that it cannot be done within a 
probability space framework. 4 Typically, the ambient framework is a probabil- 
ity space (Q,P ), it is assumed that B = (B t ) is a Brownian motion under P , 
and importantly, Po is used to define null events. Thus random variables and 
stochastic processes are defined only up to the P -almost sure qualification and 
Po is an essential part of the definition of all formal domains. However, ambiguity 
about volatility implies that nullity (or possibility) cannot be defined by any sin- 
gle probability measure. This is easily illustrated. Let B be a Brownian motion 
under P an d denote by P- and P a the probability distributions over continuous 
paths induced by Po and the two processes {g_B t ) and (aB t ), where, for simplicity, 
a and a are constants. Then P- and P CT are mutually singular (and hence not 
equivalent) because 

PH{(B)t = g_ 2 T}) = 1 = P W {{{B) T = a 2 T}). (1.1) 

To overcome the resulting difficulty, we define appropriate domains of stochastic 
processes by using the entire set of priors to define the almost sure qualification. 
For example, equality of two random variables will be taken to mean almost sure 
equality for every prior in the decision maker's set of priors. This so-called qua- 
sisure stochastic analysis was developed by Denis and Martini [6]. See also Soner 
et al. [33] and Denis et al. [4] for elaboration on why a probability space framework 
is inadequate and for a comparison of quasisure analysis with related approaches. 5 
Prominent among the latter is the seminal contribution of G-expectation due to 
Peng [28, 30, 31, 32], wherein a nonlinear expectations operator is defined by a 
set of undominated measures. We combine elements of both quasisure analysis 
and G-expectation. Conditioning, or updating, is obviously a crucial ingredient in 
modeling dynamic preferences. In this respect we adapt the approach in Soner et 

3 For example, an occasional reaction is that ambiguity about volatility is implausible because 
one can estimate the law of motion for volatility (of asset prices, for example) extremely well. 
However, this perspective presumes a stationary environment and relies on a tight connection 
between the past and future that we relax. For similar reasons we reject the suggestion that 
one can discriminate readily between nonequivalent laws of motion and hence that there is no 
loss of empirical relevance in restricting priors to be equivalent. See the companion paper for 
elaboration on these and other matters of interpretation. 

4 Where the set of priors is finite (or countable), a dominating measure is easily constructed. 
However, the set of priors in our model is not countable, and a suitable probability space 
framework does not exist outside of the extreme case where there is no ambiguity about volatility. 

5 Related developments are provided by Bion-Nadal et al. [2] and Soner et al. [34, 35]. 
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al. [36] and Nutz [25] to conditioning undominated sets of measures. 6 However, 
these analyses do not apply off-the-shelf because, for example, they permit am- 
biguity about volatility but not about drift. In particular, accommodating both 
kinds of ambiguity necessitates a novel construction of the set of priors. 

Besides those already mentioned, there are only a few relevant papers in the 
literature on continuous time utility theory. Denis and Kervarec [5] formulate mul- 
tiple priors utility functions and study optimization in a continuous-time frame- 
work; they do not assume equivalence of measures but they restrict attention to 
the case where only terminal consumption matters and where all decisions are 
made at a single ex ante stage. Bion-Nadal and Kervarec [2] study risk measures 
(which can be viewed as a form of utility function) in the absence of certainty 
about what is possible. 

Section 2 is the heart of the paper and presents the model of utility, begin- 
ning with the construction of the set of priors and the definition of (nonlinear) 
conditional expectation. Proofs are collected in appendices. 

2. Utility 

2.1. Preliminaries 

Time t varies over the finite horizon [0,T]. Paths or trajectories of the driv- 
ing process are assumed to be continuous and thus are modeled by elements of 
C d ([0,T]), the set of all Revalued continuous functions on [0, T], endowed with 
the sup norm. The generic path is ui = (u> t )te[o,T], where we write u t instead of 
to (t). All relevant paths begin at and thus we define the canonical state space 
to be 

n = {co = (cu t )eC d ([0,T]):co = 0}. 

The coordinate process {B t ), where B t (u) = uu t , is denoted by B. Information 
is modeled by the filtration J 7 = {J-t} generated by B. Let Pq be the Wiener 
measure on fl so that B is a Brownian motion under Pq. It is a reference measure 
only in the mathematical sense of facilitating the description of the individual's 
set of priors; but the latter need not contain P . 

Define also the set of paths over the time interval [0, £]: 

*fi = = (*w s ) G C d {[0,t]) :* u = 0} . 
6 Peng [28, 29] provides a related approach to conditioning. 
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Identify *f2 with a subspace of f2 by identifying any t u with the function on [0, T] 
that is constant at level t u t on [t, T\. Note that the filtration Ti is the Borel a- field 
on (Below, for any topological space we always adopt the Borel a-field even 
where not mentioned explicitly.) 

Consumption processes c take values in C, a convex subset of M. e . The domain 
of consumption processes is denoted D. Because we are interested in describing 
dynamic choice, we need to specify not only ex ante utility over D, but a suitable 
process of (conditional) utility functions. 

The key is construction of the set of priors. The primitive is the individual's 
hypotheses about drift and volatility. These are used to specify the set of priors. 
Conditioning is treated next. Finally, these components are applied to define a 
recursive process of utility functions. 

2.2. Drift and Volatility Hypotheses 

Before moving to the general setup, we outline briefly a special case where d — 1 
and there is ambiguity only about volatility. Accordingly, suppose that (speak- 
ing informally) the individual is certain that the driving process B = (B t ) is a 
martingale, but that its volatility is known only up to the interval [a, a]. In par- 
ticular, she does not take a stand on any particular parametric model of volatility 
dynamics. 

To be more precise about the meaning of volatility, recall that the quadratic 
variation process of (B t ) is defined by 



where = ti < . . . < t n — t and At k = t k+1 — t k . (By Follmer [11] and Karandikar 
[19], the above limit exists almost surely for every measure that makes B a mar- 
tingale, thus giving a universal stochastic process (B); because the individual is 
certain that B is a martingale, this limited universality is all we need.) Then the 
volatility (a t ) of B is defined by 



d{B) t = a]dt. 

Therefore, the interval constraint on volatility can be written also in the form 



The model that follows is much more general. Importantly, the interval [a, a] 
can be time and state varying, and the dependence on history of the interval at 




S \B tk+1 (u)-B tk (u) 

t k <t 



(2.1) 



< (B) t < ah. 



(2.2) 
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time t is unrestricted, thus permitting any model of how ambiguity varies with 
observation (that is, learning) to be accommodated. In addition, the model admits 
multidimensional driving processes (d > 1) and also ambiguity about drift, thus 
relaxing the assumption of certainty that B is a martingale. 

In general, the individual is not certain that the driving process has zero drift 
and/or unit variance. Accordingly, she entertains a range of alternative hypotheses 
X e = (Xf) parametrized by 9 = (9 t ). Here 9 t = (//*, cr t ) is an ^-progressively 
measurable process with values in R d x R dxd that describes a conceivable process 
for drift jj, = (/it) and for volatility a = (at). The primitive is the process of 
correspondences (Qt), where, for each t, 

Q t :Q^R d x R dxd . 

Roughly, t (oj) gives the set of admissible drift and volatility pairs at t along the 
trajectory u. The idea is that each 9 parametrizes the driving process X e = (Xf) 
given by the unique solution to the following stochastic differential equation (SDE) 
under P : 

dXf = fi t (X e )dt + a t (X e )dB t , X e = 0, t G [0, T]. (2.3) 

We assume that only #'s for which a unique strong solution exists are adopted as 
hypotheses. Therefore, denote by Q SDE the set of all processes 9 that ensure a 
unique strong solution X e to the SDE, 7 and define the set O of admissible drift 
and volatility processes by 

= [e e Q SDE : 9 t (ou) e Q t (to) for all (t, u) G [0, T] x Q} . (2.4) 

We impose the following technical regularity conditions on (0 t ): 

(i) Measurability: The correspondence (t, oo) i — > @t(w) on [0, s] xQ is B([0, s]) x 
J-'s-measurable for every < s < T. 

(ii) Uniform Boundedness: There is a compact subset K, in M. d x M. dxd such that 
G t : -w /C each t. 

(iii) Compact- Convex: Each Q t is compact-valued and convex- valued. 

(iv) Uniform Nondegeneracy: There exists a, a dxd real-valued positive definite 
matrix, such that for every t and u, if (/i t , cr t ) G @t(w), then a t aj > a. 

7 By uniqueness we mean that Po ({sup 0<t<T | X® — X[ |> 0}) = for any other strong 
solution (X' t ). A Lipschitz condition and boundedness are sufficient for existence of a unique 
solution, but these properties are not necessary and a Lipschitz condition is not imposed. 
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(v) Uniform Continuity: The process (Q t ) is uniformly continuous in the sense 
defined in Appendix A. 

(vi) Uniform Interiority: There exists 5 > such that ri 5 Q t (u) ^ for all 
t and oj, where ri s Q t (uj) is the 5-relative interior of Ot(w). (For any D C 
(R d x R dxd ) and 5 > 0, ri*D e{i£D:(i + B 5 (x)) n (aff £>) C D}, where 
aff L> is the affine hull of D and B§(x) denotes the open ball of radius 5.) 

(vii) Uniform Affine Hull: The affine hulls of t /(u/) and @i(w) are the same for 
every (f,u r ) and (t,u) in [0,T] x Q. 

Conditions (i)-(iii) parallel assumptions made by Chen and Epstein [3]. A form 
of Nondegeneracy is standard in financial economics. The remaining conditions 
are adapted from Nutz [25] and are imposed in order to accommodate ambiguity 
in volatility. The major differences from Nutz' assumptions are in (vi) and (vii). 
Translated into our setting, he assumes that int d Q t (uu) ^ 0, where, for any D, 
int s Q t (u) = {x E D : (x + B$(x)) C D}. By weakening his requirement to 
deal with relative interiors, we are able to broaden the scope of the model in 
important ways (see the first three examples below). Because each ©t(w) is convex, 
if it also has nonempty interior then its affine hull is all of M. d x M. dxd . Then 
ri 5 Q t (uj) = int s Q t (u) ^ and also (vii) is implied. In this sense, (vi)-(vii) are 
jointly weaker than assuming int 5 Q t (u) ^ 0- 

We illustrate the scope of the model through some examples. 

Example 2.1 (Ambiguous drift). If@ t (u) cM. d x {a t (w)} for every t and ui, 
for some volatility process a, then there is ambiguity only about drift. If d = 1, 
it is modeled by the random and time varying interval [/i t ,/Z t ]. The regularity 
conditions above for (0 t ) are satisfied if: of > a > and ~p t —fit > everywhere, 
and if~p, t and \it are continuous in u uniformly in t. This special case corresponds 
to the Chen and Epstein [3] model. 

Example 2.2 (Ambiguous volatility). If Q t (uj) C {/i t (uj)} x R dxd for every 
t and u, for some drift process fi, then there is ambiguity only about volatility. 
If d — 1, it is modeled by the random and time varying interval [gj,a t }. The 
regularity conditions for (Q t ) are satisfied if: a t >g_ t > a > everywhere, and if 
a t and q_ t are continuous in ou uniformly in t. 

A generalization is important. Allow d > 1 and let /j, t — 0. Then (speaking 
informally) there is certainty that B is a martingale in spite of uncertainty about 
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the true probability law. Volatility (o t ) is a process of d x d matrices. Let the 
admissible volatility processes (cr t ) be those satisfying o t G T, where F is any 
compact convex subset of M. dxd such that, for all o in T, aa T > a for some 
positive definite matrix a. 8 This specification is essentially equivalent to Peng's 
[30] notion of G-Brownian motion. 9 

Example 2.3 (Robust stochastic volatility). This is a special case of the pre- 
ceding example but we describe it separately in order to highlight the connection 
of our model to the stochastic volatility literature. By a stochastic volatility model 
we mean the hypothesis that the driving process has zero drift and that its volatil- 
ity is stochastic and is described by a single process (cr t ) satisfying regularity con- 
ditions of the sort given above. The specification of a single process for volatility 
indicates the individual's complete confidence in the implied dynamics. Suppose, 
however, that (a}) and (of) describe two alternative stochastic volatility models 
that are put forth by expert econometricians; 10 for instance, they might conform 
to the Hull and White [17] and Heston [15] parametric forms respectively. The 
models have comparable empirical credentials and are not easily distinguished em- 
pirically, but their implications for optimal choice (or for the pricing of derivative 
securities, which is a context in which stochastic volatility models are used heav- 
ily) differ significantly. Faced with these two models, the individual might place 
probability | on each being the true model. But why should she be certain that 
either one is true? Both [a]) and (of) may fit data well to some approximation, 
but other approximating models may do as well. An intermediate model such as 
{\ a t + \ a t) JS one alternative, but there are many others that "lie between" [a]) 
and (of) and that plausibly should be taken into account. Accordingly, (assuming 
d=l), let 

q_ t (u>) = min-fo* (to) , a\ (to)} and a t (u) = max{of (to) , o\ (to)}, 

and admit all volatility processes with values lying in the interval [q_ t (uj) , o t (a;)] 
for every u. 

Example 2.4 (Joint ambiguity). The model is flexible in the way it relates 
ambiguity about drift and ambiguity about volatility. For example, a form of 

8 Given symmetric matrices A' and A, A' > A if A' — A is positive semidefinite. 
9 Peng provides generalizations of Ito's Lemma and Ito integration appropriate for G- 
Brownian motion that we exploit in our companion paper in deriving asset pricing implications. 
10 There is an obvious extension to any finite number of models. 



8 



independence is modeled if (taking d — 1) 



(2.5) 



the Cartesian product of the intervals described in the preceding two examples. 

An alternative hypothesis is that drift and volatility are thought to move 
together. This is captured by specifying, for example, 



Q t (u) = {(//, a) E R 2 : n = /i min + z, a 2 = a 2 nin + 2z/ 7 , 0<z<z t (u>)}, (2.6) 



where /x m i n , o"min aD d 7 > are fixed and known parameters. The regularity 
conditions for (0 t ) are satished if It is positive everywhere and continuous in u 
uniformly in t. This specification is adapted from Epstein and Schneider [10]. 

Example 2.5 (Markovian ambiguity). Assume that (Q t ) satisfies: 



Then ambiguity depends only on the current state and not on history. Note, 
however, that according to (2.4), the drift and volatility processes deemed possible 
are not necessarily Markovian - 8 t can depend on the complete history at any time. 
Thus the individual is not certain that the driving process is Markovian, but the 
set of processes that she considers possible at any given time is independent of 
history beyond the prevailing state. 

2.3. Priors, expectation and conditional expectation 

We proceed to translate the set of hypotheses about drift and volatility into a 
set of priors. Each 9 induces (via P ) a probability measure P e on (Q, J 7 ^) given 



Fix and denote the set of priors V simply by V . This is the set of priors 
used, as in the Gilboa-Schmeidler model, to define utility and to describe choice 
between consumption processes. 11 

11 The set V is relatively compact in the topology induced by bounded continuous functions 
(this is a direct consequence of Gihman and Skorohod [12, Theorem 3.10]). 



Q t (o/) = 6 t (w). 




(2.7) 
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Remark 1. If all alternative hypotheses display unit variance, then ambiguity 
is limited to the drift as in the Chen-Epstein model, and one can show that, as 
in [3], measures in V are pairwise equivalent. At the other extreme, if they all 
display zero drift, then ambiguity is limited to volatility and many measures in 
V are mutually singular. Nonequivalence of priors prevails in the general model 
when both drift and volatility are ambiguous. 

Given V, we define (nonlinear) expectation as follows. For a random variable 
£ on (fl, J~t), if sup Pg p Ep^ < oo define 



Because we will assume that the individual is concerned with worst-case scenarios, 
below we use the fact that 



The crucial remaining ingredient of the model, and the focus of most of the 
work in the appendices, is conditioning. A naive approach to defining conditional 
expectation would be to use the standard conditional expectation Ep[£ | J^] for 
each P in V and then to take the (essential) supremum over V . Such an approach 
immediately encounters a roadblock due to the nonequivalence of priors. The 
conditional expectation Ep[^ \ J-" t ] is well defined only P-almost surely, while to 
be a meaningful object for analysis, a random variable must be well defined from 
the perspective of every measure in V . In the following, we say that a property 
holds quasisurely (q.s. for short) if it holds P-a.s. for every P G V. 12 In other 
words, and speaking very informally, conditional beliefs must be defined at every 
node deemed possible by some measure in V. The economic rationale is that 
even if P (A) = 0, for some AGTj and t > 0, if also Q (A) > for some other 
prior in V, then ex ante the individual does not totally dismiss the possibility 
of A occurring when she formulates consumption plans: if she is guided by the 
worst case scenario, then minp/ G -p P' {p\A) < 1 implies that she would reject a 

12 Throughout, when Z is a random variable, Z > quasisurely means that the inequality is 
valid P-a.s. for every P in P. If Z = (Z t ) is a process, by the statement a Z t > for every t 
quasisurely (q.s.)" we mean that for every t there exists Gt C fl such that Z t (to) > for all 
lu eG t and P (G t ) = 1 for all P in P. If Z t = for every t quasisurely, then Z = in M 2 (0, T) 



(because E[ / Z t | 2 dt] < / E[\ Z t \ 2 }dt), but the converse is not valid in general. 



E£ = sup E P £. 



(2.8) 



vatEpZ = -E[-Z\. 
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bet against A that promised a sufficiently poor prize (low consumption stream) if 
A occurs. Therefore, a model of dynamic choice by a sophisticated and forward- 
looking individual should specify her consumption plan contingent on arriving at 
(A,t). 

This difficulty can be overcome because for every admissible hypothesis 9, 
9 t (u>) is defined for every (t,u), that is, the primitives specify a hypothesized 
instantaneous drift-volatility pair everywhere in the tree. This feature of the 
model resembles the approach adopted in the theory of extensive form games, 
namely the use of conditional probability systems, whereby conditional beliefs at 
every node are specified as primitives, obviating the need to update. It resembles 
also the approach in the discrete time model in Epstein and Schneider [9], where 
roughly, conditional beliefs about the next instant for every time and history are 
adopted as primitives and are pasted together by backward induction to deliver 
the ex ante set of priors. 

To proceed, recall the construction of the set of priors through (2.3) and the set 
6 of admissible drift and volatility processes. If 9 = (9 S ) is a conceivable scenario 
ex ante, then (9 S (*o>, -))t< s <T is seen by the individual ex ante as a conceivable 
continuation from time t along the history u. We assume that then it is also 
a conceivable scenario ex post conditionally on (t,u), thus ruling out surprises 
or unanticipated changes in outlook. Accordingly, X 8 ^ = (Xf Aw ) t < s <<r is a 
conceivable conditional scenario for the driving process if it solves the following 
SDE under P : 

f dX 6 />" = fi s (X e ^)ds + a s (X e ^)dB s , t<s<T 

\ X e />« =u s ,0<s<t. {l ^> 

The solution X 8 ^ induces a probability measure P%' u G A (Q) , denoted simply 
by with 9 suppressed when it is understood that P = P e . For each P in V, 
the measure Pf G A (Q) is defined for every t and u, and, importantly, it is a 
version of the regular ^-conditional probability of P (see Lemma B.2). 

The set of all such conditionals obtained as 9 varies over 9 is denoted 7^, that 

is, 

V? = {Pf '■ P G V} . (2.10) 
We take Vf to be the individual's set of priors conditional on (t,u). 13 

13 The evident parallel with the earlier construction of the ex ante set V can be expressed 
more formally because the construction via (2.9) can be expressed in terms of a process of 
correspondences (0*' w ) t<s<T , 0*' w : C d {[t,T}) R d x R dxd , satisfying counterparts of the 
regularity conditions (i)-(vii) on the time interval [t,T]. 
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The sets of conditionals in (2.10) lead to the following (nonlinear) conditional 
expectation on UC b (Q), the set of all bounded and uniformly continuous functions 
on f2: 

£[£ | F t ] (u) = sup E P £, for every £ G UC b (Q) and (t,u) E [0,T] x Q. (2.11) 

Conditional expectation is defined thereby on U C b (f2) , but this domain is not large 
enough for our purposes. 14 For example, the conditional expectation of £ need not 
be (uniformly) continuous in u even if £ is bounded and uniformly continuous, 
which is an obstacle to dealing with stochastic processes and recursive modeling. 
(Similarly, if one were to use the space C b (Q) of bounded continuous functions.) 

Thus we consider the larger domain L 2 (£l), the completion of UC b (Q) under 
the norm || £ ||= (E[\ £ | 2 ]) 5 - 15 Denis et al. [4] show that a random variable 
£ defined on Q lies in L 2 (Q) if and only if: (i) £ is quasicontinuous - for ev- 
ery e > there exists an open set G C f2 with P (G) < e for every P in V 
such that £ is continuous on Q\G; and (ii) £ is uniformly integrable in the sense 
that lim n _ i , 00 sup Pg7 , E p (| £ | 2 l{|g| >n }) = 0. This characterization is inspired by 
Lusin's Theorem for the classical case which implies that when V = {P}, then 
L 2 (f2) reduces to the familiar space of P-squared integrable random variables. For 
general V, L 2 (Q) is a proper subset of the set of measurable random variables £ 
for which sup PeV E p (| £ | 2 ) < oo. 16 However, it is large in the sense of containing 
many discontinuous random variables; for example, L 2 (Q) contains every bounded 
and lower semicontinuous function on Q (see the proof of Lemma B.8). 

Another aspect of L 2 (Q) warrants emphasis. Two random variables £' and £ 
are identified in L 2 (Q) if and only if || — £ ||= 0, which means that £' = £ almost 
surely with respect to P for every P in V . In that case, say that the equality 
obtains quasisurely and write £' = £ qs. Thus £' and £ are distinguished whenever 
they differ with positive probability for some measure in V. Accordingly, the space 
L 2 (VL) provides a more detailed picture of random variables than does any single 
measure in V . 

14 Thc definition is restricted to UCb (fi) in order to ensure the measurability of E[£ | Ft] (•) 
and proof of a suitable form of the law of iterated expectations. Indeed, Appendix B, specifically 
(B.2), shows that conditional expectation has a different representation when random variables 
outside U Cb (£1) are considered. 

15 It coincides with the completion of Cb (fi); see [4]. 

16 For example, if V is the set of all Dirac measures with support in fi, then L 2 (fi) = Cb (fi). 
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The next theorem (proven in Appendix B) shows that conditional expectation 
admits a suitably unique and well behaved extension from UCb (fi) to all of L 2 {VL). 
Accordingly, the sets determine the conditional expectation for all random 
variables considered in the sequel. 

Theorem 2.6 (Conditioning). The mapping E[- \ Ft] on UCb(Q) defined in 
(2.11) can be extended uniquely to a 1-Lipschitz continuous mapping E[- | Ft] : 
L 2 (Q) — > L 2 (*f2), where 1-Lipschitz continuity means that 

ii Eg i T t ] - m i n b<n r - 1 b for a n ^ e T^n). 

Moreover, the extension satisfies, for all £ and r\ in L 2 (Vt) and for all t e [0, T], 

E[E[£ | F t ] | F s ] =E[£\F a ], for0<s<t< T, (2.12) 

and: 

(i) If^>V, thenE[Z\T t }>E[n\T t }. 

(ii) If £ is Tt-me&sur&ble, then E[£ \ F t ] = £. 

(in) E[£ \ Tt] + E[r) \ F t ] > E[£ + 77 I .7^] with equality if rj is ^-measurable, 
(iv) E[r)£ I Fj] = i] + E[^ I F t ] + ^"i^— £ I F t ], if 77 is F t - measurable. 

The Lipschitz property is familiar from the classical case of a single prior, 
where it is implied by Jensen's inequality (see the proof of Lemma B.8). The law 
of iterated expectations (2.12) is intimately tied to dynamic consistency of the 
preferences discussed below. The nonlinearity expressed in (iii) reflects the nons- 
ingleton nature of the set of priors. Other properties have clear interpretations. 

2.4. The definition of utility 

Because we turn to consideration of processes, we define M 2 '°(0,T), the class of 
processes 77 of the form 

JV-l 

Vt(u) = ^&(w)l Mi+1 )(t), 

i=0 

where & G L^ft), < % < N - 1, and = t < • • • < t N = T. 17 Roughly, each 
such rj is a step function in random variables from the spaces L 2 ( U VL). For the 

17 The space 4i f2 was defined in Section 2.1. 
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usual technical reasons, we consider also suitable limits of such processes. Thus 
define M 2 (0,T) to be the completion of M 2 '°(0,T) under the norm 



Consumption at every time takes values in C, a convex subset of Con- 
sumption processes c = (c t ) lie in D, a subset of M 2 (0,T). 

For each c in D, we define a utility process (V t (c)), where V t (c) is the utility of 
the continuation {c s ) 0<s<t and Vq (c) is the utility of the entire process c. We often 
suppress the dependence on c and write simply (V t ). We define utility following 
Duffie and Epstein [7]. This is done in order that our model retain the flexibility 
to partially separate intertemporal substitution from other aspects of preference 
(here uncertainty, rather than risk by which we mean 'probabilistic uncertainty'). 

Let G and V = V @ be as above. The other primitive component is the 
aggregator f : C x R 1 — > R 1 . It is assumed to satisfy: 

(i) / is Borel measurable. 

(ii) Uniform Lipschitz for aggregator: There exists a positive constant K such 
that 

| f(c,v') - f(c,v) |< K | v' -v | , for all (c,v',v) G C x R 2 . 

(iii) (f(ct,v))o< t <T e M 2 (0,T) for each v G R and c G D. 

We define V t by 



This definition of utility generalizes both Duffie and Epstein [7], where there is 
no ambiguity, and Chen and Epstein [3], where ambiguity is confined to drift. 
Formally, they use different sets of priors: a singleton set in the former paper and 
a suitable set of equivalent priors in the latter paper. 
Our main result follows (see Appendix C for a proof). 

Theorem 2.7 (Utility). Let (0 t ) and f satisfy the above assumptions. Fix 
c G D. Then: 

(a) There exists a unique process (V t ) in M 2 (0,T) solving (2.13). 

(h) The process (V t ) is the unique solution in M 2 (0,T) to V T = and 




V t =- 



E[-tff( Cs ,V s )ds\T t ] . 



(2.13) 




(2.14) 
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Part (a) proves that utility is well defined by (2.13). Recursivity is established 
in (b). 

The most commonly used aggregator has the form 

f(c t ,v)=u(c t )-(3v, f3>0, (2.15) 
in which case utility admits the closed-form expression 

V t = -E[-J u(c s )e-P s ds\F t ]. (2.16) 

More generally, closed form expressions are rare. The following example illus- 
trates the effect of volatility ambiguity. 

Example 2.8 (Closed form). Consider the consumption process c satisfying 
(under P ) 

dlogct = s T a t dB t , c > given, (2-17) 

where s is constant and the volatility matrix at is restricted only to lie in the 
compact and convex set T, as in Example 2.2, corresponding to Peng's [30] notion 
of G-Brownian motion. Utility is defined by the standard aggregator, 

V t (c) = -E[- J u{c s )e-^ds | 

where the felicity function u is given by 

u(c t ) = {c t ) a /a, 0^ a < 1. 

Then the conditional utilities Vt (c) can be expressed in closed form. To do so, 
define q_ and a as the respective solutions to 

min tr (aa T ss T ) and maxtr (aa T ss T ) . (2.18) 
<rer v ' aer v ' 

(Ifd = 1, then T is a compact interval and a and a are its left and right endpoints.) 
Let P* be the measure on Q induced by P and X* , where 

XI = a T B t , for all t and cu; 
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define P** similarly using a and X**. They are worst-case (or minimizing) mea- 
sures in that 18 



Vo(c) 




/ a- l (c r ) a e-P T dT 
•Jo 



a~ l (cr) a e-P T dT 



if a < 
if a > 



(2.19) 



This follows from Levy et al. [21] and Peng [32], because u (c t ) = a^ 1 exp (a log c t ) 
and x i — > e ax /a is concave if a < and convex if a > 0. From (2.17), almost 
surely with respect to P^ T \ 



a 1 c t cxp <^ a 



s o T dB 1 



It follows that utility can be computed as if the volatility a t were constant and 
equal to a (if a < 0) or a (if a > 0). 

For t > 0, employ the regular conditionals of P* and P**, which have a simple 
form. For example, following (2.9), for every (t,u>), (P*)t is the measure on f2 
induced by the SDE 

' dX T = adB T , t<r <T 

X T = C0 T , < T < t 

Thus under (P*)?, B T - B t is N (0,oa T (r - t)) fort<r<T. Further, (P* 



is 



the worst case measure in if a < 0; similarly, (p**)^ i s the worst case measure 
in Vf if a > 0. Repeat the argument used above for t = to obtain 



Vt(c) 




if a < 
if a > 



and, by computing the expectations, that 

a-ic^-i^ _ e -7(T-*)) if a < o 
a^c^-^l - e-l^-^) if a > 



Vt(c) 



where 



— An L 2 T T A 

7 = p — -a s cr cr s, 7 = 



L 2 T T 

a s aa s. 

2 — 



18 That the minimizing measure corresponds to constant volatility is a feature of this example. 
More generally, the minimizing measure in V defines a specific stochastic volatility model. 
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Utility has a range of natural properties. Most noteworthy is that the pro- 
cess (V t ) satisfies the recursive relation (2.14). However, though such recursivity 
is typically thought to imply dynamic consistency, the nonequivalence of priors 
complicates matters. The noted recursivity implies the following weak form of 
dynamic consistency: For any < t < T, and any two consumption processes d 
and c that coincide on [0,t], 

[V t (c) > V t (c) q.s] V (d) > V (c) . 

Typically, (see Duflie and Epstein [7, p. 373] for example), dynamic consistency 
is defined so as to deal also with strict rankings, that is, if also V t (d) > V t (c) on a 
"non-negligible" set of states, then Vq (d) > Vq (c). This added requirement rules 
out the possibility that d is chosen ex ante though it is indifferent to c, and yet 
it is not implemented fully because the individual switches to the conditionally 
strictly preferable c for some states at time t. The issue is how to specify "non- 
negligible". When all priors are equivalent, then positive probability according 
to any single prior is the natural specification. In the absence of equivalence a 
similarly natural specification is unclear. In particular, as illustrated in [8], it is 
possible that d and c be indifferent ex ante and yet that: (i) d > c on [0, T] and 
they coincide on [0, t}; (ii) there exists t > 0, an event N t G T t , with P (N t ) > for 
some P EV, such that d T > c T for t < r < T and oo G A^ r . Then monotonicity of 
preference would imply that d be weakly preferable to c at t and strictly preferable 
conditionally on (t, N t ), contrary to the strict form of dynamic consistency. 19 The 
fact that utility is recursive but not strictly so suggests that, given an optimization 
problem, though not every time optimal plan may be pursued subsequently at 
all relevant nodes, under suitable regularity conditions there will exist at least one 
time optimal plan that will be implemented. This is the case for an example in 
[8], but a general analysis remains to be done. 



19 The reason that c' and c are indifferent ex ante is that Nt is ex ante null according to the 
worst-case measure for c. 
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A. Appendix: Uniform Continuity 

Define the shifted canonical space by 

& = {cu G C d ([t,T]) I cu t = 0}. 

Denote by B l the canonical process on fi*, (a shift of B), by P * the probability 
measure on such that B l is a Brownian motion, and by J 7 ' = {J-^}t<r<T the 
filtration generated by B l . 

Fix < s < t < T. For ui G VL S and u) G fi*, the concatenation of w and uj at t 
is the path 

(w <g> t w) T = w T l [S)t )(r) + (w t + o) r )l[ ti r](r), s < r < T. 

Given an J 7 !,- measurable random variable £ on fi s and w G f2 s , define the shifted 
random variable on fi* by 

f(Q) =^(w§ ( w), u) G ft*. 

For an ^-progressively measurable process (X T ) s < r < T , the shifted process (X* ,tJ ) t < r < T 
is ^-progressively measurable. 

Let (Os) 0<s<T be a process of correspondences as in Section 2.2. For each (t, uj) 
in [0, T] x ft, define a new process of correspondences ( < S) t s ' u ')t<s<T by: 

0^(w) 4 s ( w ® t u)), w G ft*. 

Then 

0*' w : ft* ~* M d x M dxd . 

The new process inherits conditions (i)-(iv) and (vi). The same is true for (v), 
which we define next. 

The following definition is adapted from Nutz [25, Defn. 3.2]. Say that (Q t ) is 
uniformly continuous if for all 5 > and (t, uj) G [0, T] xft there exists e(t, uj, 5) > 
such that if sup 0<s<t | u s — uj' s |< e, then 

r^e^iui) C ri e e*' w '(a>) for all (s,w) G [t,T] x ft*. 

The process (0*), and hence also 0, are fixed throughout the appendices. Thus 
we write V instead of V e . Define V° C V by 

V° = {P G V : 35 > t (w) G ri*e t (w) for all (t,u) G [0,T] x ft}. (A.l) 
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B. Appendix: Conditioning 



Theorem 2.6 is proven here. 

In fact, we prove more than is stated in the theorem because we prove also 
the following representation for conditional expectation. For each t G [0, T] and 
P EV, define 

V(t, P) = {P' G V : P' = P on T t }. (B.l) 
Then for each f G t G [0, T] and P G V, 20 

E[£ | T t ] = ess sup Epi[£ \ F t \, P-a.e. (B.2) 

p'er(t,p) 

Perspective on this representation follows from considering the special case 
where all measures in V are equivalent. Fix a measure P m P ■ Then the condition 
(B.2) becomes 

E[€ I J~t] — ess sup Epi[£ | J-'t], P -a.e., for every P <EV. 
p'er(t,p) 

Accordingly, the random variable on the right side is (up to Po -num ty) indepen- 
dent of P. Apply U PeV V(t, P) = V to conclude that 21 

E[£ | Ft] = ess sup Ep>[£ | Ft], Po-a.s. 

P'dV 

In other words, conditioning amounts to applying the usual Bayesian conditioning 
to each measure in V and taking the upper envelope of the resulting expectations. 
This coincides with the prior-by-prior Bayesian updating rule in the Chen- Epstein 
model (apart from the different convention there of formulating expectations using 
infima rather than suprema). When the set V is undominated, different measures 
in V typically provide different perspectives on any random variable. Accordingly, 
(B.2) describes E[£ \ Tt\ completely by describing how it appears when seen 
through the lens of every measure in V . 

20 Because all measures in V(t, P) coincide with P on J t , essential supremum is denned as in 
the classical case (see He et al. [14, pp. 8-9], for example). Thus the right hand side of (B.2) is 
defined to be any random variable £* satisfying: (i) £* is ^-measurable, Ep,[t; \ F t ] < £* P-a.e. 
and (ii) £* < £** P-a.e. for any other random variable £** satisfying (i). 

21 The Po-null event can be chosen independently of P by He et al. [14, Theorem 1.3]: Let H be 
a non-empty family of random variables on any probability space. Then the essential supremum 
exists and there is a countable number of elements (£„) of H such that esssup% = V£„. 
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Our proof adapts arguments from Nutz [25]. He constructs a time consis- 
tent sublinear expectation in a setting with ambiguity about volatility but not 
about drift. Because of this difference and because we use a different approach to 
construct the set of priors, his results do not apply directly. 

For any probability measure P on the canonical space ft, a corresponding 
regular conditional probability Pf is defined to be any mapping Pf : ft x Tt — > 
[0, 1] satisfying the following conditions: 

(i) for any u, Pf is a probability measure on (ft, Tt)- 

(ii) for any A G Ft, cu — > P^{A) is Tt- measurable, 
(hi) for any A G J= T , E p [l A \ F t \{u) = P?(A), P-a.e. 

Of course, Pf is not defined uniquely by these properties. We will fix a version 
defined via (2.9) after proving in Lemma B.2 that Pf defined there satisfies the 
conditions characterizing a regular conditional probability. This explains our use 
of the same notation Pf in both instances. 

If P is a probability on ft s and u G ft s , for any A G F T we define 

P^{A) = P?(u ® t A), 

where to <S>t A = {cu <g> t Cj \ uj G A}. 
For each (t,w) G [0,T] x Q, let 

e s (Q) = (fi s (Q),a s (Q)) G ri*e*' w (a)) for all (s,u) G [t,T] x ft*, 

where 5 > is some constant. Let X t,e = (X 1 /) be the solution of the following 
equation (under Pq) 

dX 1 / = n 8 {X*> 9 )ds + (TsiX^dBl Xt' e = 0, s G [t,T]. 

Then X t,e and Pq induce a probability measure P*' 61 on ft*. 

Remark 2. For nonspecialists we emphasize the difference between the preceding 
SDE and (2.9). The former is defined on the time interval [t,T], and is a shifted 
version of (2.3), while (2.9) is defined on the full interval [0, T\. This difference is 
reflected also in the difference between the induced measures: the shifted measure 
Pq G A (ft*) and the conditional measure Pf G A (ft). Part of the analysis to 
follow concerns shifted SDE's, random variables and measures and their relation 
to unshifted conditional counterparts. (See also Appendix A.) 
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Let V°(t,u) be the collection of such induced measures P t,e . Define 
deg(t, u, P t,e ) = 5*/2 > 0, where S* is the supremum of all 5 such that 

9,(Q) G ri'e*' w (o>) for all s and w. 

Note that at time t — 0, P°(0, u) does not depend on u and it coincides with P°. 

For each (t, u>) G [0, T] x f2, let V(t, u) be the collection of all induced measures 
P t,d such that 

e s (Q) = (fi s (Q),<T s (Q)) G 0*'"(u)) for all (s,w) G [f,T] x fi*. 
Note that V(0,lu) = V. 

Now we investigate the relationship between V(t,u) and . (Recall that for 
any P = P 9 in V, the measure P t w is defined via (2.9) and V% is the set of all such 
measures as in (2.10).) 

For any 9 = (//, a) G 0, (£, to) G [0, T] x f2, define the shifted process 9 by 

a (d>) = (/i s (£),<7 s (u))) 4 (/i^(cu), (u))) for (s,d>) G [t,T] x ft*. (B.3) 
Then s (u)) G 0*' w (cj). Consider the equation 

f dX s = £ s (X.)cfe + a s (X.)dBl, s G [t,T], 

\ x t =0. 

Under Pq, the solution X induces a probability measure P*' 6 * on Jl*. By the 
definition of P(t, w), P*' 9 " G P(t, u). 

Lemma B.l. {(P')*' w : P' G P^} = P(t,w). 

Proof. C: For any 9 = (//, cr) G 0, P = P e and (t,w) G [0,T] x Q, we claim 
that 

^pu>y,u _ pt,e 

where P t,e is defined through (B.4). Because B has independent increments under 
Pq, the shifted solution (X^ ,w ) t < s < T of (2.9) has the same distribution as does 
(X s ) t < s <T. This proves the claim. 

D: Prove that for any P*' 6 * G V(t,u), there exists G such that (where 
P = P e ) 

P? G P^ and (Pff'" = P tfi . 
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Each P 1 ' 9 in V(t, u) is induced by the solution X of (B.4), where 9 (u), defined in 
(B.3), lies in 0*' w (u)) for every Co G fl*. For any 9 G such that 

consider the equation (2.9). Under P , the solution X induces a probability P t w on 
Q. Because B has independent increments under P , we know that (Pf) t,u = P*' e . 
This completes the proof. ■ 

Lemma B.2. For any G and P = P e , {Pf : (i,u;) G [0,T] x fi} is a version 
of the regular conditional probability of P. 

Proof. Firstly, for any < ti < ■ ■ ■ < t n < T and bounded, continuous 
functions if and tp, we prove that 

E p [ V (B tlM , B tnM 'W(B tl B tn )} = E p [ip(B tlAt , B tnM )^ t ] (B.5) 
where t G [tk,tk+i) and, for any Co G Q, 

UCj) 4 P^ a [^(^(t 1 ), . . .,u>(tk)Mt) + • • -Mt) + Bl)}. 

Note that P*' 9 ~ on is induced by X (see (B.3)) under P *, and P = P 9 on Q 
is induced by X = X e (see (2.3)) under P . Then, 

MX(co)) = (B.6) 
. . .,X t „( U ),X t ( U ) + X tfc+1 (o>), . . .,X t (u) + X t »)]. 

Because B has independent increments under P , the shifted regular conditional 
probability 

P ^ = P *, Po-a.e. 

Thus (B.6) holds under probability Pq U . 

Because Pq' w is the shifted probability of (Po)£\ we have 

= (x t >), . . . ,x,»,x, fc+ », . . . ,x tn (co))} 

= E p ^(X tl (u),...,X tk (u),X tk+1 (u),...,X tn (u)) \T t ]{u), P -a.e. 
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Further, because P is induced by X and P , 



E p [<p(B tlAt ,.. 


■ , B tnA t)ipt\ 




= E p »[<p(X tlM ,. 


..,x tnAt )Mx)} 




= E p ^(X tlAt ,. 
= E p °[<p{X tlAt ,. 
= E p [<p(B tlAt ,.. 


..,X tnAt )E p ^(X tl ,... 




..,x tnAt )4>(x tl ,...,x t , 


j] 


■ ,B tnM )ip(B tl , . . . ,B tn ] 


>]■ 



Secondly, note that (B.5) is true and <p and (ti, . . . ,t n ) are arbitrary. Then 
by the definition of the regular conditional probability, for P-a.e. to G f2 and 

t e [t k , t k+1 ), 

= E^M&fa), . . .Mtk)Mt) + Bl k+V . . . ,u(t) + B l)l (B.7) 

where P 1 ^ is the shift of the regular conditional probability of P given (t, Co) G 
[0,T]xa 

By standard approximating arguments, there exists a set M such that P(M) = 
and for any u £ M, (B.7) holds for all continuous bounded function ip and 
(ti,...,t„). This means that for w ^ M and for all bounded J 7 ^- measurable 
random variables £ 

e 1 ' C = 

Then P 1 ^ = P'- e " P-a.e. By Lemma B.l, P 1 ^ = P 1 ' 9 = (P^)'- w , P-a.e. Thus If 
is a version of the regular conditional probability for P. ■ 

In the following, we always use Pf defined by (2.9) as the fixed version of 
regular conditional probability for P G V . Thus 

E p ^i = E p \i\T t \^), P-a.e. 

Because we will want to consider also dynamics beginning at arbitrary s, let 
< s < T, w G fl, and P G V(s,u>). Then given (t,u) G [s,T] x Q s , we can fix a 
version of the regular conditional probability, also denoted Pf (here a measure on 
VI s ), which is constructed in a similar fashion via a counterpart of (2.9). Define 

V?(s,Q) = {P? : P G V(s,Q)} and 

P t ^( S , W ) = {Pr :PeV°(s,co)}. 
In each case, the obvious counterpart of the result in Lemma B.l is valid. 
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The remaining arguments are divided into three steps. First we prove that 
if £ G UCb(Q) and if the set V is replaced by V° defined in (A.l), then the 
counterparts of (B.2) and (2.12) are valid. Then we show that V° (resp. V (t,u)) 
is dense in V (resp. V(t,u)). In Step 3 the preceding is extended to apply to all 
£ in the completion of UCb(Q). 

Step 1 

Given £ G UC b (Q), define 

v?(u) = sup E p ^, {t,u) e[0,T]xQ. (B.8) 

Lemma B.3. Let < s < t < T and Co G Q. Given e > 0, there exist a sequence 
(uj l )i>x in il s , an J 7 ^ -measurable countable partition (P l )j>i ofVL s , and a sequence 
(P l )i>i of probability measures on Q* such that 

(i) || uj — Cj 1 \\[ s ,t]= sup s<r<t | u) T — Cj 1 t \< e for all oj G E l ; 

(ii) P i G P°(t, b ® s to) for" all w G E l and inf deg(t, u> ® s to, P l ) > 0; 

ujeE' 

(iii) ® s w*) < E pi [^®^} + e. 

Proof. Given e > and Cj G by (B.8) there exists P(u) G 7>°(t, cu ® s Cj) 
such that 

u t °(w ® s Oj) < E p ^[^' a ] + e. 
Because (0 t ) is uniformly continuous, there exists e{uj) > such that 

P(Cj) G V°(t,u)® s u') for all J G £(e(cD),u>) and inf deg(t,u<g) s u', P(Cj)) > 

w'eB(e(w),w) 

where B(e,Cj) = {u/ G fi s ||| u/ — a) ||[ s ,t]< e} is the open || ■ ||[ S;t ] ball. Then 
{B(e(Cj),Cj) | a) G f2 s } forms an open cover of Q s . There exists a countable 
subcover because Q s is separable. We denote the subcover by 

B i = B(e(Cj i ),Cj i ), i = l,2,... 

and define a partition of Q s by 

E 1 = B 1 , E l+1 = B l+1 \(E l U • • • U E l ), i > 1. 

Set = P(w i ). Then (i)-(iii) are satisfied. ■ 

For any A G J 7 ^, define 

= G ft* | u <g) t UJ G A}. 
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Lemma B.4. Let < s < t < T, Co G and P G V°(s,u). Let (^)o<i<JV 
be a finite J- -measurable partition of Q s . For 1 < % < N, assume that P l G 
V°(t, Q ® s u) for all u G E i and that inf deg(t, Co ® s u, P l ) > 0. Dehne P by 

N 

P{A) 4 P(A n P°) + ^S p [P i (A t ' w )l^(o;)], A G J^. 

i=i 

Tien: (ij P G P°(s,w). 

(ii) P = P on J 7 /. 

(Hi) pt," = ptv p- a .e.[uj] onE°. 

(iv) P^ = P l P-a.e.[u\ on E\l<i< N. 

Proof, (i) Let 9 (resp. 9 l ) be the J 75 (resp. J 7 ') -measurable process such that 
P = P 9 (resp. P l = P 0t ). Define by 

N 

e T (u>) ± e T (u)i [s>t) (r) + [e T (u)i E o(u) + ^>')i^H]i [tiT] (r) (b.9) 

i=i 

for (r,w) G [s,T] x Then P e ~ G P°(s,u>) and P = P s on 

(ii) Let A G J 7 /. We prove P(A) = P(A). Note that for to G if to G A, 
then A*-" = Q*; otherwise, = 0. Thus, P'^A^) = l A (oo) for 1 < % < N, and 

N N 

p{A) = P{A n £°) + 5> p [uHi Ei H] = $> P L4 n tf] = p(A). 

i=l i=0 

(iii) -(iv) Recall the definition of P t w by (2.9). Note that P = P § where is 
defined by (B.9). Then it is easy to show that the shifted regular conditional 
probability P t,w satisfies (iii)-(iv). ■ 

The technique used in proving Nutz [25, Theorem 4.5] can be adapted to prove 
the following dynamic programming principle. 

Proposition B.5. Let < s < t < T, £ e UC b {Q) and define v° t by (B.8). Then 

v° s {oo) = sup E p '[(v Q t ) s ^} for all ooeQ, (B.10) 
p'ev°(s,u>) 

v° s = esssup E p '[v Q t | J 7 ,] P-a.e. for all P G V°, (B.ll) 

P'er°(s,p) 

and 

v° s = esssup E p '[i | J 7 ,] P-a.e. for all P G V°. (B.12) 

p'ev°(s,p) 
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Proof. Proof of (B.10): First prove <. Let to E f2 and u E £l s , by Lemma 
B.l, 



{(P')*' w | P' E V^(s,Q)} = V (t,u® s u). 



For P E V°(s,Cu) 



< sup E p '[e^} ( B13) 

P'eP°(t,a)® 3 a;) v ' ' 

= Vt(u® s u). 

Note that i>°(u> <8> s w) = (f°) s,a; (w). Taking the expectation under P on both sides 
of (B.13) yields 

E P C'° < E p [(v°) s >*}. 

The desired result holds by taking the supremum over P E V°(s,ui). 

Prove >. Let 5 > 0. Because *fi is a Polish space and (v^) s,ul is J-^-measurable, 
by Lusin's Theorem there exists a compact set G E T% with P(G) > 1 — 5 and 
such that (vtY'^ is uniformly continuous on G. 

Let e > 0. By Lemma B.3, there exist a sequence (uJ l )i>i in G, an P/- 
measurable partition (P*)j>i of G, and a sequence (P l )j>i of probability measures 
such that 

(a) || oj — Cj 1 ||[ S) t]< e for all uj E E % \ 

(b) i* E V°(t, Q ® s to) for all u E E i and inf deg(t, w ® s w, P*) > 0; 

(c) ® s Cj 1 ) < E pi [^®^} + e. 

Let 

Ajv^^U-'-Uf?^ N>1. 

For P G P°(s,u>), define 

V°(s, u, t, P) 4 {P' g P°(s, w) : P' = P on P/}. 

Apply Lemma B.4 to the finite partition {P 1 , . . . , E N , A C N } of VI s to obtain a 
measure P N E V°(s,oj) such that P N E V°(s,u,t,P) and 

p t,u, = f P^forwG^, 
* " \ P* for w G P*, 1 < i < N K > 

Because and £ are uniformly continuous on G, there exist moduli of 

continuity p^tY<*\G) (.) and (.) such that 

| (v^(u) - («?)•■>') |< w - a/ || M ), 
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Let 00 e E i for some 1 < i < N. Then 

(u t °) s '°M 

< K°)^(c£; J )+p ( ^ 0)S " |G) (e) 

< E P> [e ^ ]+e + p ((^\G) {e) 

< £j*[£t,S>®. W ] + p (0( e ) + e + p (K )'' o |G)( e ) l B - i5 ) 

= £ p !T [£* + p (0 ( e ) + e + p(W) s '*\G) ( e ) 

= | + p (?) (e) + e + p (( ^ 0)3,Q|G) (e) for P-a.e. w G £*. 

These inequalities are due respectively to uniform continuity of v®, Lemma B.3(iii), 
uniform continuity of £, equation (B.14), and the fact that P/v G V(t, P). Because 
P = Pn on J 7 /, taking the P-expectation on both sides yields 

E p [(v°) s '"1 An ] < E p »[^1 An ] +P^(e) + e + p^\G) {e y 

Note that £ G UC b (Q) and P N (G\A N ) = P(G\A^) ->■ as N ->■ oo. Let 
A" — >• oo and e — >■ to obtain that 

P P [K°) S '*1 G ] < sup E p '[C' Q l G }- 
p'ev°(s,u!,t,p) 

Because 5 > is arbitrary, similar arguments show that 

E p [(v?) 8fi) ]< sup E P 'C' Q < sup E p 'i s ^ = v°(u). 
P'eV°(s,o,t,P) P'eV°(s,o) 

But P G V°(s,lu) is arbitrary. This completes the proof of (B.10). 
Proof of (B.ll): Fix P G V°. First we prove that 

u° < ess sup P p '[£ | 7i] P-a.e. (B.16) 

P'er°(t,p) 

Argue as in the second part of the preceding proof, specialized to s = 0. Conclude 
that there exists P/v G V°(t,P) such that, as a counterpart of (B.15), 

v°(u) < E p "\i | F t ](u) + p«)(e) + e + p^°l G )(e) for P-a.e. to G A*. 

Because P = py on J^, as A" — > oo and 5 — > 0, one obtains (B.16). 

Now prove the inequality < in (B.ll). For any P' G P°(s,P), we know that 
(P')t^ e V°{t,u). From (B.10), 
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Taking the conditional expectation on both sides yields E p '[^ | J 7 ,] < E p '[v^ \ 
J 7 ,] P'-a.e., hence also P-a.e. Thus 

v® < ess sup E p '[£ | J 7 ,] < ess sup E p '[v® | J 7 ,] P-a.e. 

P'eP°( s ,P) ' P'eP°(s,P) 

Thirdly, we prove the converse direction holds in (B.ll). For any P' G 
V°(s,P), by (B.10) we have 

v°M>EW'~[(v°)'^=E'"[v°\r.](u) 

P'-a.e. on T s and hence P-a.e. 

Equation (B.12) is implied by (B.ll) because v% — £. ■ 



STEP 2 

Refer to the topology induced on A (fl) by bounded continuous functions as 
the weak-convergence topology. 

Lemma B.6. (a) V° is dense in V in the weak convergence topology. 

(b) For each t and ui, V°(t,u) is dense in V(t,u) in the weak convergence 
topology. 

Proof, (a) Let P e " G V° and P e G V, and define 

o e = ee° + (i-e)e, 

where < e < 1. By Uniform Inferiority for (6 t ), there exists 5 > such that 
t e (w) G ri 5 e t (u) for all t and w. Thus P r G V°. 

By the standard approximation of a stochastic differential equation (see Gih- 
man and Skorohod [12, Thm 3.15]), as e — > there exists a subsequence of X e \ 
which we still denote by X e \ such that 

sup | Xf -X* K Po-a.e. 

0<t<T 

The Dominated Convergence Theorem implies that P^ — > P e . 

(b) The proof is similar. ■ 

For any 9 = (//, a) G O, (t,u) G [0,T] x Q, define the shifted process 0, the 
process X and the probability measure P 1 ' 9 exactly as in (B.3) and (B.4). As 
noted earlier, P t,e G V(t,u>). 
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Recall that for any P in V, the measure Pf is defined via (2.9); Vf is the set 
of all such measures (2.10). By construction, 

P? ({Co G Q : u s = u s s G [0, t}}) = 1. 

We show shortly that Pf is a version of the regular conditional probability for P. 
Given £ eUC b (Q), define 

v t (u)= sup E p ^, (t,w) G [0,T] x ft. (B.17) 

Pev(t,u) 

Lemma B.7. For any (t,u) G [0,T] x Q and £ G C/C 6 (ft), we have 

v t (u>)=v°(u>), (B.18) 
= sup E p £, (B.19) 

and 

v t = ess sup £ p '[£ | 7i] P-a.e. for all P eV. (B.20) 

P'eV(t,P) 

Furthermore, for any < s < t < T, 

v s (u) = sup E p '[{v t ) s ^} for all ueVt, (B.21) 

P'ev(s,u>) 



and 

v s = ess sup £ p '[^ | F 8 ] P-a.e. for all P eV. (B.22) 

P'eP(s,P) 

Proof. Proof of (B.18): It is implied by the fact that V°(t,u) is dense in 
V(t,u). 

Proof of (B.19): By the definition of v t (cu), we know that 

v t (u)= sup E p ^- 
PeP(t,ui) 

By Lemma B.l, 

{{P'f" \P' eVf} = V(t,u). 

Thus 

v t {u) = sup E pt '"^ = sup E P C 
PeP? PeP? 
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Proof of (B.20): Fix P E V. For any P' E V(t,P), by Lemmas B.l and B.2, 
(P')t," e -p(t,u). By the definition of v t (co), 

P'-a.e. on T t and hence P-a.e. Thus 

v t > ess sup E p [£ | Ft] P-a.e. 

p'ev(t,p) 

Now we prove the reverse inequality. By (B.18), v t (u>) = Vt(oo). Then, using 
the same technique as in the proof of Proposition B.5 for the special case s = 0, 
there exists Pjv E V(t,P) such that, as a counterpart of (B.15), 

v t (u) < E p "[£ | F t ](u) + p«)(e) + e + p^l G )(e) 

< sup£ p ~[£ | + p«)(e) + e + p(^l G )(e) 

v 

for P-a.e. u E Aw- Let iV — >■ oo to obtain that, for P-a.e. u E G, 

v t (u>) < esssup£ p '[£ | ^]H + p«)( e ) + e + p^l G )(e). 
P'eV(t,P) 



Let e — )■ to derive 



t>i < ess sup [£ | J^], P-a.e. on G. 
P'ev(t,p) 

Note that G depends on <5, but not on e. Let 5 — > and conclude that 

v t < ess sup E p [£ | Ft], P-a.e. 
p'ev(t,p) 



Proof of (B.21) and (B.22): The former is due to (B.10) and the fact that P°(t, w) 
is dense in P(t,u). The proof of (B.22) is similar to the proof of (B.12) in Propo- 
sition B.5. ■ 



Now for any £ E UCf,(Q), we define conditional expectation by 

E[Z\F t }(u)±v t (uj). 
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STEP 3 

Thus far we have defined E[£ \ F t ](ou), for all t and u>, for any £ G UC^iVL). 
Now extend the operator E[- \ Ft] to the completion of UCb(Q). 

Lemma B.8 (Extension). The mapping E[- \ F t ] on UCb{^l) can be extended 
uniquely to a l-Lipschitz continuous mapping E[- \ F t ] : L 2 (f2) — > L 2 (*fi). 

Proof. Define L 2 (*f2) to be the space of J^-measurable random variables X 
satisfying 

|| X ||4 (E[\ X | 2 ])5 = (sup£ p [| X | 2 ])5 < oo. 

Obviously L2(*ft) C L2(*fi). 

(i) We prove that E[- \ F t ] can be uniquely extended to a l-Lipschitz continuous 
mapping 

E[- | F t ] : L 2 (fi) ->■ L2(*Q). 
For any £ and 77 in C/C 6 (Q), 

i Eir i J*] - % i f] i 2 < (ski r - e i) i h) 2 < e[\ e - e ri jy, 

where the first inequality follows primarily from the subadditivity of E[- \ Ft], and 
the second is implied by Jensen's inequality applied to each PinV. Thus 

ii e[? i Ft] - m i n ii 

= (E [l %' I H - E\i I Ft] | 2 ]) V2 
r - i \ 1/2 

< (e [e[\ e'-eri^]j) 
= (^D r-er]) 1/2 =n c-eii, 

where the second equality is due to the 'law of iterated expectations' for integrands 
in UCb{Q) proven in Lemma B.7. 

As a consequence, E[£ \ F t ] € L 2 ('f2) for £ and rj in UCb(Q), and £■[• | .F t ] 
extends uniquely to a l-Lipschitz continuous mapping from L 2 (Q) into L 2 ('f2). 

(ii) Now prove that E[ \ F t ] maps L 2 (fl) into L 2 ('f2). 
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First we show that if £ G UCb(Q), then £[£ | F t ] is lower semi continuous: Fix 
u G Q. Since £ G UC b {Q), there exists a modulus of continuity such that for 
all u' G Q and Co G fi* 

ieH-e(w / )l<P (€) (l|w-w' ll[o,T]), and 
|^(w)-^(w)|<p«>(||a;-a/ 

Consider a sequence (o> n ) such that || w — cu n ||[o,t]~* 0. For any P G V(t,u), by 
the Uniform Continuity assumption for (Of), we know that P G V(t,u n ) when n 
is large enough. Thus 

liminf-y+fcUr, 



n— >oo 

= lim inf sup i? p 



> lim inf sup £ p '£^ - w - w n || [0)t] 

n-foo y \p> eV{t!<jJn) J 
sup E P '£*^ > E P ^ . 

p'ev(t,u n ) 

Because P G V(t,u) is arbitrary, this proves that liminf n >oo v t(u n ) > v t (u), 

which is the asserted lower semicontinuity. 

Next prove that any bounded lower semicontinuous function / on t Q is in 
L 2 ( f fl): Because t Q is Polish, there exists a uniformly bounded sequence f n G 
C fe ( t fi) such that /„ t / f° r & U ^ G fl By Gihman and Skorohod [12, Theorem 
3.10], P is relatively compact in the weak convergence topology. Therefore, by 
Tietze's Extension Theorem (Mandelkern [24]), C 6 (*Q) C L 2 (*fi), and by Denis et 
al. [4, Theorem 12], sup PeV E p (| / - /„ | 2 ) ->■ 0. Thus / G L 2 ('ft). 

Combine these two results to deduce that E[£ \ Ft] G L 2 (*f2) if £ G UCf,(Q). 
From (i), {E[£ \ J^t] '■ £ £ -^ 2 (^)} i s contained in the || • ||-closure of {E[£ \ J-" t ] : 
£ G C/C 6 (^)}- But {E[£ | 7i] : £ G £/C 6 (^)} is contained in L^'ft), which is 
complete under || ■ ||. This completes the proof. ■ 

Proof of (B.2): Fix P G V and X G L 2 ^). Given e > 0, there exists X G C/C b (Q) 
such that 

|| E[X | 7i] -E[X | 7i] ||<|| X-X ||< e. 
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For any P' G V(t,P), 

E P '[X\ Tj\-E[X\ F t ]_ 
= E P '[X-X\ F t ] + (E P '[X | F t ] - E[X | Ft]) + (E[X \ F t ] - E[X | F t ]). 

(B.23) 

From Karatzas and Shreve [20, Theorem A. 3], derive that there exists a se- 
quence P n G V(t,P) such that 

ess sup E P '[X | Ft] = lim E Pn [X \ F t ] P-a.e. (B.24) 

P'ev(t,p) n ^°° 

where P-a.e. the sequence on the right is increasing in n. Then by Lemma B.7, 
E[X | F t ] = ess sup E P '[X | F t ] = lim E P "[X \ F t ] P-a.e. (B.25) 



p'ev(t,p) 



Denote L 2 (fi, J" r , P) by £ 2 (P). Compute L 2 (P)-norms on both sides of (B.23) 
to obtain, for every n, 

|| E P "[X_\ Ft] - E[X \ Ft]h HP) 

< \\X-X \\ LHPn) + \\E P -[X | Ft] - E[X | Ft] Wlhp) + || E[X \ F t ] - E[X \ F t ] || £2(P) 

< || E P "[X | 7i] - E[X | Ji] || L2(P) +2e 

By (B.25), 

lim sup || E Pn [X | Ft] - E[X \ F t ] \\< It. 

n— >oo 

Note that e is arbitrary. Therefore, there exists a sequence P n G P(t, P) such that 
P/"[X | 7i] ->• E[X | J 7 *], P-a.e., which implies that 

E[X | 7i] < ess sup E P '[X \ F t ]. (B.26) 

P'ev(t,p) 

As in (B.24), there exists a sequence P^ G P(t, P) such that 

ess sup E P '[X | 7i] = lim E P -[X \ F t ] P-a.e. 
p'ev(t,p) n ~ + °° 

with the sequence on the right being increasing in n (P-a.e.). Set 
A n 4 {P p A[x | 7i] > E[X | By (B.26), P-a.e. 

< (P P "[X I 7i] - P[X I JiDU, / ess sup P p [X | 7i] - P[X | F t ]. 

P'€T(t,P) 
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By (B.23) and (B.25), P-a.e. 

E P -[X | F t \ - E[X | F t \ < E K [X-X\ F t \ + (E[X \ F t \ - E[X \ F t \) 

Take L 2 (P)-norms to derive 

|| ess sup E P '[X | F t \ - E[X \ F t \ || L2(P) 
P'eP(t,P) 

= lim || (E P *[X | F t ] - E[X | F t \)l An || L2(P) 

n— >oo 

< lim sup \\X-X || L 2 ( p,) + || E[X | F t \ - E[X \ F t \ || L2(P) 

< 2e. 

This proves (B.2). 

Proo/ 0/ It is sufficient to prove that, for < s < t < T, P-a.e. 

ess sup E p ' [X I F s ] 

P'ev(s,p) 

= ess sup E p [ ess sup E p [X \ Ft] \ F s ] 

P'ev(s,p) P"eV(t,P') 



(B.27) 



The classical law of iterated expectations implies the inequality < in (B.27). Next 
prove the reverse inequality. 

As in (B.24), there exists a sequence P" G V(t,P') such that P'-a.e. 

lim E p ™ [X I F t \ | ess sup E P "[X | F t \ 

ra->oo P"eV(t,P') 

Then 

E p '[ ess sup E P "[X \ F t \ \ F s \ 

P"&V(t,P') 

= lim E p ™[X I F s \ 

< esssupP p [X I F s ] P-a.e. 

PeP(s,P) 

This proves (2.12). 

Proof properties (i)-(iv) is standard and is omitted. This completes the proof 
of Theorem 2.6. ■ 
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C. Appendix: Proofs for Utility 

Proof of Theorem 2.7: Part (a). Consider the following backward stochastic 
differential equation under E: 

V t = E[i + f{c a ,V 8 )ds\Tt\, te[0,T], 

where £ G L 2 (Vt T ) is terminal utility and c G D. (Equation (2.13) is the special 
case where £ = 0.) Given V G M 2 (0,T), let 

A t (V) = E[Z + £ f(c 8 ,V 9 )ds\T t ], te[0,T}. 

We need the following regularity property of A. 

Lemma C.l. A is a mapping from M 2 (0,T) to M 2 (0,T). 

Proof. By assumption (ii) there exists a positive constant K such that 

\f(c s ,V s )-f(c s ,0)\<K\V s \, se[0,T}. 

Because both V and (f(c s , 0)) < s <t are in M 2 (0,T), we have (f(c s , V s )) < s <t G 
M 2 (0,T). Thus 

E[\t + tff( Cs ,V s )dsf} 

< 2E[\Z\i+(T-t)tf\f( Cs ,V s )\ids} 

< 2E[\ £ | 2 ] + 2(T - t)E[J t T | f(c s , V a ) | 2 ds] < oo, 

rp _ ^ 

which means that (f + J t f(c s , V s )ds) G L 2 (9). Argue further that 

E[\A t {V) | 2 ] =E[(m + J t T f(c.,V.)ds\r t ])*] 
<E[E[\£ + f? f(c s ,V s )ds\ 2 \F t ]] 
= E[\£ + jf f(c s ,V s )ds f\] < oo. 

Finally, (£ / \ A t \ 2 dt)^ < ( [ E[\ A t \ 2 ]dt)^ < oo and A G M 2 (0,T). ■ 
Jo ./o 
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For any V and V G M 2 (0,T), by Theorem 2.6 the following approximation 
holds: 

\ A t (V) - A t (V) | 2 



= (E[f t (f( c s,V s )-f(c s X))ds\T t ]y 

< E[(tf\f(c s ,V s )-f(c s X)\ds) 2 \F t } 

< (T — t)K 2 E{ 

< LE{[ | V a -V 8 \ 2 ds | T t ], 



(C.l) 



where K is the Lipschitz constant for the aggregator and L = TK 2 . (The first 
inequality is due to the classical Jensen's inequality and (B.2).) Then for each 
re[0,T], 



E 



A t (V) - A t (V) | 2 dt] 



<LE[^ E[J 



v s -v;\ 2 ds\F t }dt} 



<L E[ | V, - V 8 | 2 ds]dt 



<L(T-r)E[ \V 8 -Vj\ 2 ds]. 

J r 

Set 5 = ± and n = max{T - 5, 0}. Then, 

E[ f | A t (V) - A t (V) | 2 dt] < \e\ f | K - VI | 2 dt], 

which implies that A is a contraction mapping from M 2 (n,T) to M 2 (n,T) and 
there exists a unique solution (14) G M 2 (ri,T) to the above BSDE. Because 5 is 
independent of t, we can work backwards in time and apply a similar argument 
at each step to prove that there exists a unique solution (V t ) G M 2 (0,T). 

Part (b). Uniqueness of the solution is due to the contraction mapping property 
established in the proof of (a). 
By Theorem 2.6, 



— E 
= -E 

= —E[E 
= -E\- 



j; f( Cs ,v s )ds - v T i F t ] 

J7 f(c„ V s ) ds + e\— f(c s , V s ) ds | J~- T 



n 



ft f(c s ,V s )ds | J- T 
f(c s ,V s )ds\F t ] = V t . 
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